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Abstract 



The constraint satisfaction problem (CSP) is a general problem central to com- 
puter science and artificial intelligence. Although the CSP is NP-hard in general, 
I considerable effort has been spent on identifying tractable subclasses. The main 

■ two approaches consider structural properties (restrictions on the hypergraph of con- 

straint scopes) and relational properties (restrictions on the language of constraint 
relations). Recently, some authors have considered /ii/6rid properties that restrict the 
constraint hypergraph and the relations simultaneously. 
^ . Our key contribution is the novel concept of a CSP pattern and classes of prob- 

I lems defined by forbidden patterns (which can be viewed as forbidding generic sub- 

problems). We describe the theoretical framework which can be used to reason about 
classes of problems defined by forbidden patterns. We show that this framework gen- 
eralises relational properties and allows us to capture known hybrid tractable classes. 

Although we are not close to obtaining a dichotomy concerning the tractability 
of general forbidden patterns, we are able to make some progress in a special case: 
classes of problems that arise when we can only forbid binary negative patterns 
(generic sub-problems in which only inconsistent tuples are specified). In this case 
we are able to characterise very large classes of tractable and NP-hard forbidden 
patterns. This leaves the complexity of just one case unresolved and we conjecture 
that this last case is tractable. 

Keywords: Constraint satisfaction problem, tractability, forbidden substructures. 
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1 Introduction 



In the constraint satisfaction paradigm we consider computational problems in which we 
have to assign values (from a domain) to variables, under some constraints. Each constraint 
limits the (simultaneous) values that a list of variables (its scope) can be assigned. In a 
typical situation some pair of variables might represent the starting times of two jobs in a 
machine shop scheduling problem. A reasonable constraint would require a minimum time 
gap between the values assigned to these two variables. 

Constraint satisfaction has proved to be a useful modelling tool in a variety of contexts, 
such as scheduling, timetabling, planning, bio-informatics and computer vision. This has 
led to the development of a number of successful constraint solvers. Unfortunately, solving 
general constraint satisfaction problem (CSP) instances is NP-hard and so there has been 
significant research effort into finding tractable fragments of the CSP. 

In principle we can stratify the CSP in two quite distinct and natural ways. The 
structure of the constraint scopes of an instance of the CSP can be thought of as a hyper- 
graph where the variables are the vertices, or more generally as a relational structure. We 
can find tractable classes by restricting this relational structure, while allowing arbitrary 
constraints on the resulting scopes [10]. Sub-problems of the general constraint problem 
obtained by such restrictions are called structural. Alternatively, the set of allowed assign- 
ments to the variables in the scope can be seen as a relation. We can choose to allow only 
specified kinds of constraint relations, but allow these to interact in an arbitrary structure 
[20] . Such restrictions are called relational or language-based. 

Structural subclasses are defined by specifying a set of hypergraphs (or relational struc- 
tures) which are the allowed structures for CSP instances. It has been shown that tractable 
structural classes are characterised by limiting appropriate (structural) width measures 
[TT| [T3] [T9] [T5| 123] . For example, a tractable structural class of binary CSPs is obtained 
whenever we restrict the constraint structure (which is a graph in this case) to have bounded 
tree width [Til [13]. fact, it has been shown that, subject to certain complexity-theoretic 
assumptions, the only structures which give rise to tractable CSPs are those with bounded 
(hyper-)tree width [21 [HI dSl [23] . 

Relational subclasses are defined by specifying a set of constraint relations. It has been 
shown that the complexity of the subclass arising from any such restriction is precisely 
determined by the so called polymorphisms of the set of relations [U H] . The polymorphisms 
specify that, whenever some set of tuples is in a constraint relation, then it cannot be 
the case that a particular tuple (the result of applying the polymorphism) is not in the 
constraint relation. It is thus the relationship between allowed tuples and disallowed tuples 
inside the constraint relations that is of key importance to the relational tractability of 
any given class of instances. Whilst a general dichotomy has not yet been proven for the 
relational case, many dichotomies on sub-problems have been obtained, e.g. [21 [H [3]. 

Unfortunately, by allowing only structural or relational restrictions we limit the possible 
subclasses that we can define. By considering restrictions on both the structure of the 
constraint graph and the relations, we are able to identify new tractable classes. We call 
these restrictions hybrid reasons for tractability. 
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Several hybrid results have been published for binary CSPs [221 El 1211 El IZ] • Instead of 
looking at the constraint graph or the constraint language, these works captured tractabil- 
ity based upon the properties of the (coloured) microstructure of CSP instances. The 
micro structure of a binary CSP instance is the graph (y, E) where V is the set of pos- 
sible assignments of values to variables and E is the set of pairs of mutually consistent 
variable- value assignments [22j. In the coloured microstructure, the vertices representing 
an assignment to variable Vi are labelled by a colour representing variable Vj, thus main- 
taining the distinction between assignments to different variables. 

The coloured microstructure of a CSP instance captures both the structure and the 
relations of a CSP instance and so it is a natural place to look for tractable classes which 
are neither purely structural nor purely relational. 

Of the results on (coloured) microstructure properties, two are of particular note. First 
it was observed that the class of instances with a perfect microstructure is tractable |28j . 
This is a proper generalisation of the well known hybrid tractable CSP class whose instances 
allow arbitrary unary constraints and in which every pair of variables is constrained to be 
not equal p3| 129] . and of the hybrid class whose microstructure is triangulated [22| |5]. 
The perfect microstructure property excludes an infinite set of induced subgraphs from 
the microstructure. In this paper, we provide a different hybrid class that also strictly 
generalises the class of CSP instances with an inequality constraint between every pair 
of variables and an arbitrary set of unary constraints, but does so by forbidding a single 
pattern. 

Secondly, the so called broken-triangle property properly extends the structural notion 
of acyclicity to a more interesting hybrid class [6]. The broken triangle property is spec- 
ified by excluding a particular pattern (a subgraph) in the coloured microstructure. It is 
this property that we generalise in this paper. We do this by working directly with the 
CSP instance (or equivalently its coloured microstructure) rather than its microstructure 
abstraction which is a simple graph. This allows us to introduce a language for expressing 
hybrid classes in terms of forbidden patterns, so obtaining novel hybrid tractable classes. 
In the case of binary negative patterns we are able to characterise very large classes of 
tractable and NP-hard forbidden patterns. This leaves the complexity of just one case 
unresolved and we conjecture that this last case is tractable, which would give us a new 
CSP dichotomy for hybrid classes of binary CSPs defined by negative patterns. 

Contributions 

In this paper we generalise the definition of a CSP instance to that of a CSP pattern 
which has two types of tuple in its constraint relations, tuples which are explicitly al- 
lowed/disallowed and tuples which are labelled as unknow^ By defining a natural notion 
of containment of patterns in a CSP, we are able to describe problems defined by for- 
bidden patterns: a CSP instance P forbids a particular pattern if this pattern cannot be 
contained in P. By defining problems in this way, we can capture both allowed and disal- 

^This can be viewed as the natural generalisation of the CSP to a three- valued logic. 
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lowed constraint tuples as well as structural properties. We use this framework to capture 
tractability by identifying local patterns of allowed and disallowed tuples (within small 
groups of connected constraints) whose absence is enough to guarantee tractability. 

By extending the notion of the CSP instance to that of a pattern we are able to unify 
the following properties: 

• having a particular polymorphism; 

• having a hereditary (coloured) microstructure property, such as broken triangle [6]; 
and 

• having a tree structure (tree width 1). 

Using the concept of forbidden patterns, we lay foundations for a theory that can be 
used to reason about classes of CSPs defined by hybrid properties. Since this is the first 
work of this kind, we primarily focus on the simplest case: binary patterns in which tuples 
are either disallowed or unknown (called negative patterns). We give a large class of binary 
negative patterns which give rise to intractable classes of problems and, using this, show 
that any negative pattern that defines a tractable class of problems must have a certain 
structure. We are able to prove that this structure is nearly enough to guarantee tractability 
and conjecture that there is a precise condition providing dichotomy for tractability defined 
by forbidding binary negative patterns. Importantly, our intractability results also allow 
us to give a necessary condition on the form of general tractable patterns. 

The remainder of the paper is structured as follows. In Section |2] we define constraint 
satisfaction problems, and give other definitions which will be used in the paper. Then, in 
Section |3l we define the notion of a CSP pattern and describe classes of problems defined 
by forbidden patterns. In Section |4] we show that one must take the size of patterns into 
account to have a notion of maximal classes defined by forbidding patterns. In Section |5] we 
give some examples of tractable classes defined by forbidden patterns on three variables. 
In general, we are not yet able to make any conjecture concerning a dichotomy for hybrid 
tractability defined by general forbidden patterns. However, in Section we are able to give 
a necessary condition for such a class to be tractable. Finally, in Section [7] we summarise 
the results of this work and discuss directions for future research. 

2 Preliminaries 

Definition 2.1. A CSP instance is a triple {V,C,D) where 

• V is a finite set of variables (with n = \V\). 

• D is a finite set called the domain (with d = \D\). 

• C is a set of constraints. Each constraint c & C is a pair c = {a, p) where 

— a is a list of variables called the scope of c. 
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— p is a relation over D of arity \a\ called the relation of c. It is the set of tuples 
allowed by c. 

A solution to the CSP instance P — (V, D, C) is a mapping s -.V ^ D where, for each 
{(T-iP) & C we have s{a) e p (where s{a) represents the tuple resulting from the application 
of s component- wise to the list of variables a). 

The arity of a CSP is the largest arity of any of its constraint scopes. Our long-term 
aim is to identify all tractable subclasses of the CSP problem which can be detected in 
polynomial time. In this paper we describe a general theory of forbidden patterns for 
arbitrary arity but only consider the implications of the new theory for tractable classes 
of arity two (binary) problems specified by finite sets of forbidden patterns. In such cases 
we are certain that class membership can be decided in polynomial time. 

The CSP decision problem, which asks whether a particular CSP instance has a solution, 
is already NP-complete for binary CSPs. For example, there is a straightforward reduction 
from graph colouring to this problem in which vertices i of the graph map to CSP variables 
Vi and edges map to disequality constraints Vi ^ Vj. 

It will sometimes be convenient in this paper to use an equivalent functional formulation 
of a constraint. In this alternative formulation the scope a of the constraint {a, p) is 
abstracted to a set of variables and each possible assignment is seen as a function f : a ^ D. 
The constraint relation in this alternative view is then a function from the set of possible 
assignments, D"" , into the set {T, F} where, by convention, the tuples which occur in the 
constraint relation are those which map to T. It follows that any assignment to the set of 
all variables is allowed by ((t, p) when its restriction to a is mapped to T by p. 

Definition 2.2. For any function f : X ^ Y and S d X, the notation f\s means the 
function with domain S satisfying f\s{x) = f{x) for all x & S. 

Given a set V of variables and a domain D, a constraint in functional representation 
is a pair {a, p) where a (Z V and p : t— > {T, F}. A CSP instance in functional repre- 
sentation is a triple {V, D, C) where C is a set of constraints in functional representation. 

A solution (to a CSP instance {V,D,C) in functional representation) is a mapping 
s : V ^ D where, for each {a, p) E C we have p{s\a) = T. 

The functional formulation is clearly equivalent to the relational formulation and we 
will use whichever seems more appropriate throughout the paper. The choice will always 
be clear from the context. 

Relational tract ability of binary CSP 

We will refer to a set of relations P on some finite set D as a constraint language. 

Definition 2.3. Let D be a finite set and let T be a set of relations on D. We define 
CSP(r) to be the set of problems for which every constraint (o", p) satisfies p^T. 
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A constraint language F is said to be tractable if CSP(r') is a tractable class of problems 
for each finite T' C F. It is well-known that the tractability of F can be determined by 
studying the polymorphisms of F [[4] . 

Definition 2.4. Let D be a finite set and let p be a binary relation on D. A k-ary 
polymorphism of p is a function f : i— t- D satisfying 

Vxi,...,Xfc e p, (/(xi[l],...,Xfc[l]),/(xi[2],...,Xfc[2])) G p. 

It is known that the existence of a non-trivial polymorphism is a necessary condition 
for a set of relations to give rise to a tractable constraint language [201 SI [1] • Using this 
characterisation, almost all tractable classes of the CSP defined by sets of relations have 
been determined, though establishing the full dichotomy still remains an open problem. 

Structural tractability of binary CSP 

Structural tractability considers the classes of problems defined by placing restrictions on 
the set of constraint scopes, but which allow arbitrary constraint relations. For simplicity, 
and as this is the focus of this paper, we restrict our attention to binary CSPs. In this case, 
the set of constraint scopes defines the constraint graph whose vertices are the variables 
and whose edges are the set of scopes of constraints whose relation is not complete (i.e. the 
Cartesian product D^). All definitions and concepts extend naturally to non-binary CSPs. 
The key property here is the tree width of the constraint graph. 

Definition 2.5. Let G be a graph. A tree decomposition of G is a pair (T, X), where T is 
a tree and X is a mapping that associates with every node t G V{T) a set Xt C V{G) such 
that for every v G V{G) the set {t G V{T) \ v G Xt} is connected, and for every e G E{G) 
there is at E V{T) such that e G Xt. 

The width of a tree decomposition (T, X) is max{|Xt| — 1 | t G V{T)}. The tree width 
of a graph G, denoted tw{G), is the minimum of the widths of all tree decompositions of 
G. 

The following theorem is classical pT| [T3] . 

Theorem 2.6. Let P be a CSP. If the constraint graph of P, Gp, has tw{Gp) < k, then 
we can solve P in time 0{nd^^^). 

What is more, under reasonable technical assumptions, there is no property of Gp which 
gives rise to a larger tractable class of CSPs. This establishes a dichotomy for structural 
tractability of binary CSPs. A similar result has been obtained for CSPs of higher arity. 
See P [m [TBI [23] for more details. 
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3 Forbidden patterns in CSP 



In this paper we explain how we can define classes of CSP instances by forbidding the 
occurrence of certain patterns. A CSP pattern is a generalisation of a CSP instance. In a 
CSP pattern we define the relations relative to a three- valued logic on {T, F,U}, meaning 
that the pattern is simply the set of CSP instances in which each undefined value U is 
replaced by either T or F. Forbidding a CSP pattern is equivalent to simultaneously 
forbidding all these instances as sub-problems. 

Definition 3.1. We define a three-valued logic on {T, F,U}, where U stands for unknown 
or undefined. The set {T, F,U} is partially ordered so that U < T and U < F but T and 
F are incomparable. 

Let D be a finite set. A k-ary three-valued relation on D is a function p : i— )■ 
{T,F, U}. Given a pair of k-ary three-valued relations p and p' , we say p realises p' if 

Vx e D\p{x) > p' . 

Definition 3.2. A CSP pattern is a triple x = {V, F),C) where: 

• V is the set of variables. 

• D is the domain. 

• C is a set of constraint patterns. Each constraint pattern c E P is a pair c = {a, p), 
where o" C 1/, the scope a of c, is a set of variables and p : i— )■ {T, F,U} is the 
three-valued relation (in functional representation) of c. 

The arity of a CSP pattern x is the maximum arity of any constraint pattern {a, p) of x- 

We will sometimes define p as a partial function from D" to {T, F}; the values for 
which p is undefined are those which are mapped to U. For simplicity of presentation, we 
assume throughout this paper that no two constraint patterns in C have the same scope 
(and that, in the case of CSP instances, that no two constraints have the same scope). We 
will represent binary CSP patterns by simple diagrams. Each oval represents the domain 
of a variable, each dot a domain value. The tuples in constraint patterns with value F are 
shown by dashed lines, those with value T by solid lines and those with value U are not 
depicted at all. 

Contexts 

By further adding simple structure to the domains and variable sets of patterns, we are 
able to make the notion of patterns more specific, and so we can capture larger, and 
more interesting, tractable classes. Contexts such as these have been used in the past to 
capture tractable classes. For example, when the domain is totally ordered we can define 
the tractable max-closed class [21J, and when we have an independent total order for the 
domain of each variable we can capture the renamable Horn class [T6] . 
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The weakest such context that we will consider only allows us to say when two variables 
are distinct. A pattern with such a context will be called flat. We give a general definition 
of context, but in this paper the only contexts we require are total orders on the variable 
set or the domain. 

Definition 3.3. A CSP context is a set of relational structures Vt on the product of the 
variable set and domaii^. 

A CSP pattern {V, D,C) is considered in context Q by associating it with a structure 
u E Q for appropriately-sized variable set and domain. 

Let {V, D) and {V, D') be in context VL, with uj andu' the elements ofVt giving structure 
to the sets V ^ D and V x D' , respectively. A contextual homomorphism is an Vt- 
structure preserving function F : V ^ D \^ V ^ D' , i.e. for each {u,a), {v,b) E V x D, 
{{u, a), {v, h)) G oj implies {F{u, a), F{v, b)) G u' . 

Definition 3.4. Two CSP patterns are compatible if they are considered in the same 
context. 

Thus, for example, two CSP patterns with totally-ordered domains are compatible even 
if the domain sizes are different. In this contextual homomorphism between the 

two patterns must preserve the domain ordering. 

Patterns, CSPs and occurrence 

A CSP instance is just a CSP pattern in which the three- valued relations of the constraint 
patterns never take the value U. That is, we decide for each possible tuple whether it is in 
the relation or not. Furthermore, in a CSP instance, for each pair of variables we assume 
that a constraint exists with this scope; if no explicit constraint is given on this scope, then 
we assume that the relation is complete, i.e. it contains all tuples. This can be contrasted 
with CSP patterns for which the absence of an explicit constraint on a pair of variables 
implies that the truth value of each tuple is undefined. 

In order to define classes of CSP instances by forbidding patterns, we require a formal 
definition of an occurrence (containment) of a pattern within an instance. We define the 
more general notion of containment of one CSP pattern within another pattern. Informally, 
the names of the variables and domain elements of a CSP pattern are inconsequential and 
a containment allows a renaming of the variables and the domain values of each variable. 
Thus, in order to define the containment of patterns, we firstly require a formal definition of 
a renaming. In an arbitrary renaming two distinct variables may map to the same variable 
and two distinct domain values may map to the same domain value. However, we do not 
allow distinct variables vi,V2 to map to the same variable if vi,V2 belong to the scope of 
any binary constraint (otherwise this binary constraint could not be correctly represented 
on a scope consisting of a single variable). 

^We tacitly assume that a context contains at least one structure for every combination of sizes of 
variable set and domain. 
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A domain labelling of a set of variables is just an assignment of domain values to those 
variables. Variable and domain renaming induces a mapping on the domain labellings 
of scopes of constraints: we simply assign the renamed domain values to the renamed 
variables. There is a natural way to extend this mapping of domain labellings to a mapping 
of a constraint pattern: the truth-value of each mapped domain labelling is the same as 
the truth-value of the original domain labelling. However, it may occur that two domain 
labellings of some scope map to the same domain labelling, so instead the resulting value 
is taken to be the greatest of the original truth-values. (In order for this process to be well- 
defined, if two domain labellings of a constraint are mapped to the same domain labelling, 
then their original truth-values must be comparable.) This leads to the following formal 
definition of a renaming which is the first step towards the definition of containment. 

Definition 3.5. Let x = {V, D, C) and x' = {V\ D', C) he compatible CSP patterns. 

We say that x' is a renaming of x if there exist a variable renaming function s -.V ^ 
V and a domain renaming function t : V x D D' that satisfy: 

• If s{vi) = s{v2) for distinct variables Vi, V2, then there is no constraint pattern {a, p) G 
C with vi,V2 G a and p a non-trivial relation (a function which is not identically equal 
to U). 

• F : V X D V X D' defined by F{{v, a)) = {s{v), t{v, a)) is a contextual homomor- 
phism. 

• For each constraint pattern {a, p) G C , for any two domain labellings £, i' G D"^ for 
which F{£) = F{i'), we have that p{i) and p{£') are comparable. 

. C = {{s{a),p') I (a,p) G C}, where p\f) = max{p(£) | F{£) = f}, for each f : 
s{a) ^ D. 

We will use patterns to define sets of CSP instances by forbidding the occurrence (con- 
tainment) of the patterns in the CSP instances. In this way we will be able to characterise 
tractable subclasses of the CSP. Informally, a pattern x is said to occur in a CSP instance 
P if we can find variables in P corresponding to the variables of x, such that there is a 
constraint in P that realises each constraint pattern in x- We will now formally define 
what we mean by a pattern occurring in another pattern (and by extension, in a CSP 
instance). 

Definition 3.6. We say that a CSP pattern x occurs in a CSP pattern P = {V, D, C) (or 

that P contains x), denoted x — )■ P, if there is a renaming (V, D, C) of x where, for every 
constraint pattern {a, p') G C there is a constraint pattern {cr, p) E C and, furthermore, p 
realises p' . 

Example 3.7. This example describes three simple containments. Consider the three 
constraint patterns. Pattern [T](i)-(iii). These constraint patterns occur in, or are contained 
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Pattern 1 




in, Pattern [2] by the contextual homomorphisms Fi, F2, and -F3, respectively, which we will 
now describe. 

Fi is simply a bijection. Although the patterns are different, this is a valid containment 
of Pattern (H^i) into Pattern [2] because the three- valued relation of Pattern [2] is a realisation 
of the three- valued relation in Pattern [T]^i) : we are replacing (6, d) —> U hj {b, d) — )■ F. 

F2 maps (x, a), {x, b), and {y, c) to themselves, and maps both {y, d) and (y, d') to {y, d). 
This merging of domain elements is possible because the three-valued constraint relation of 
Pattern [U^ii) agrees on tuples involving the assignments (y, d) and (y, d') and, furthermore, 
the restriction of the three- valued relation of Pattern [Tt^ii) to either of these two assignments 
is equivalent to the three- valued constraint relation of Pattern |2) (6, d) — t- F and (a, d) — > T. 

Finally, F3 maps (y, c) and {y, d) to themselves, and maps (x, a) and {z, b) in Pat- 
tern [T|^iii) to (x, a) and (x, 6), respectively, in Pattern [2J As before, this merging of variables 
is possible because the three-valued relations agree. □ 

Before continuing we need to define what we mean when we say that a class of CSP 
instances is definable by forbidden patterns. 

Definition 3.8. Let C he any class of CSP instances. We say that C is definable by 
forbidden patterns if there is some context VL and some set of patterns X for which the 
set of CSP instances in which none of the patterns in X occur are precisely the instances 
in C. 



10 



Notation: Let A" be a set of CSP patterns with maximum arity k. We will use CSP(A') 
to denote the set of CSP instances with arity at most k (compatible with X) in which 
no element x G occurs. When X is a. singleton {x} we will allow CSP(x) to mean 

csp(M)- 

Tractable Patterns 

In this paper we will define, by forbidding certain patterns, tractable subclasses of the 
CSP. Furthermore, we will give examples of truly hybrid classes (i.e. not definable by 
purely relational or purely structural properties). 

Definition 3.9. A set of patterns X is intractable if CSP{X) is NP-hard. It is tractable 

if there is a polynomial-time algorithm to solve CSP(A'). A single pattern x is tractable 
(intractable) if {x} is tractable (intractable). 

It is worth observing that classes of CSP instances defined by forbidding patterns do 
not have a fixed domain. Recall, however, that CSP instances have finite domains. 

We will need the following simple lemma for our proofs of intractability results in later 
sections of this paper. 

Lemma 3.10. Let X and T be sets of compatible CSP patterns and suppose that for every 
pattern r & there is some pattern x ^ f^'^ which j-t. Then C'SP{X) C CSP(T). 

Proof. Let P G CSP (A*), so Xt^-P for each x ^ ■ Then we cannot have r— t-P for any 
T E T, since this would imply that there exists some x G such that x^ ^ P 
hence that x^P- Hence, P G CSP(r). □ 

Corollary 3.11. Let X and T be sets of compatible CSP patterns and suppose that for 
every pattern r eT , there is some pattern x G A" for which % — > r. 

We then have that CSP(T) is intractable ifCSP{X) is intractable and conversely, that 
CSP (A") is tractable whenever CSP(T) is tractable. 

Finally, we give a very simple example of a tractable pattern. This is an example of a 
negative pattern since the only truth-values in the relations are F and U. We will use the 
notation NEQ(t'i, . . . , v^) to denote the fact that the variables Vi, . . . ..v^ are all distinct. 

Example 3.12. Consider the pattern given as Pattern[31 This defines a class of CSPs which 
is trivially tractable. We can apply a pre-processing step consisting of first establishing 
arc consistency, and then assigning value c to variable x (and eliminating the variable x) if 
this assignment is consistent with all remaining assignments to other variables. Forbidding 
Pattern [3] ensures that after this pre-processing step there are no paths of length greater 
than 2 in the constraint graph. Thus, any problem forbidding Pattern [3] can be decomposed 
into a set of independent sub-problems, each of maximum size 2. □ 
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Pattern 3 A very simple negative pattern. 



V 




NEQ(w,«;,x), c^c' 



Relational and structural tractability as forbidden patterns 

The following examples show certain strengths of this notion for defining tractable classes. 
First, in Example 13.131 we show that forbidden patterns properly generalise polymor- 
phisms. Then, in Example I3.14[ we show that we can define the class of tree-structured 
CSPs by a single forbidden pattern. 

Example 3.13. Let (D, <) be any totally ordered domain. A binary relation p over D is 
said to be max-closedif, for any tuples (a, b) , {a', b') G p we have that (max(a, a'), max(fe, b')) G 
p. It is well known that the class of CSP instances whose relations are binary max-closed is 
tractable [21]. We can define the class of max-closed CSPs as those forbidding the following 
pattern (Pattern Hj): 

• CSP context: the variable set is unstructured and the domain is totally ordered. 

• Variables: {x,?/}, where x^y. 

• Domain: The ordered set {0, 1} with < 1. 

• A single constraint pattern with scope {x, y} and three-valued relation: 

{x 0, y i-> 0} t-)- f/ 
{x^O,y^l}^T 
{x^l,y^O}^T 
{s t-). 1, y h-). 1} t-). F 



In this pattern, the context specifies that x ^ y and < 1, so we have limited the contextual 
homomorphisms to those that map x and y to distinct variables and and 1 to ordered 
domain values. Thus, saying that a pattern MAX2 is forbidden in a CSP instance P is 
equivalent to saying there is no constraint allowing a pair of tuples (a, b) and (a', b'), where 



12 



a < a' and b' < b, such that {a',b) is disallowed; this is equivalent to saying that every 
constraint must be max-closed. □ 



Pattern 4 The MAX2 pattern. 




X y 

Context: x y, < 1 



The set of max-closed relations are also known as the relations which are closed under 
the polymorphism max(a;,?/). 

Recall from Definition 12.41 that, given a finite set D and a binary relation p on D, a 
/c-ary polymorphism of p is a function / : D'^ i— ^ D satisfying 



"ixi, . . . ,Xk e p, f{xi, 



,Xk) e p, 



where f{xi,...,Xk) = {f{xi[l],...,Xk[l]),f{xi[2],...,Xk[2])). Clearly, we can define the 
set of relations which have a particular k-aij polymorphism / as the set of relations forbid- 
ding a particular set of patterns, namely those which allow k tuples xi, . . . ,Xk but which 
disallow f{xi, . . . ,Xk). Thus, every class of binary CSPs defined by having a particular 
polymorphism can be defined using forbidden patterns. 

We now turn our attention to structural tractability. In Example 13.141 below we show 
that a forbidden pattern can capture the class of CSPs with tree width 1. 



Pattern 5 Tree structure pattern (Tree) 




Example 3.14. Consider the pattern Tree, given as Pattern O We will show that the 
class CSP(Tree) is exactly the set of CSPs whose constraint graph is a forest (i.e. has 
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tree width 1). First, suppose P G CSP(Tree). Then, there exists some ordering tt = 
{vi, . . . , Vn) such that each variable shares a constraint with at most one variable preceding 
it in the ordering. On the other hand, suppose P is a CSP whose constraint graph is a 
tree. By ordering the vertices according to a pre-order traversal, we obtain an ordering 
in which each variable shares a constraint with at most one variable preceding it in the 
ordering (its parent); thus, P G CSP (Tree). 

□ 

4 On characterising tractable forbidden patterns 

In relational tractability we can define a maximal tractable sub-problem of the CSP prob- 
lem. Such a class of relations is maximal in the sense that it is not possible to add even 
one more relation to the set without sacrificing tractability. 

In the case of structural tractability the picture is less clear, since here we measure 
the complexity of an infinite set of hypergraphs (or, more generally, relational structures). 
We obtain tractability if we have a bound on some width measure of these structures. 
Whatever width measure is chosen we have a containment of the class with width bounded 
by k inside that of the class of width bounded by A; + 1 and so no maximal class is possible. 
Nevertheless, for each k there is a unique maximal class of structurally tractable instances. 

In this section, we show that in the case of forbidden patterns the situation is similar. 

Definition 4.1. Let x = {V, D, C) and r = {V, D', C) he any two flat CSP patterns. We 
can form the disjoint unions l/uy' and Dli^D' . Now, extend each constraint pattern in C 
to he over the domain DUD' hy setting the value of any tuple including elements of D' to 
he U , and extend similarly the constraint patterns in C . In this way we can define Cl^C 
and then we set the disjoint union of x CLnd r to he xUr = {VliiV, DliJD', CliiC'). 

Lemma 4.2. Let x o,nd r he flat hinary CSP patterns. Then 

CSP(x)UCSP(r)CCSP(3eCF). 
Moreover, we have that CSP(xWr) is tractahle whenever CSP (x) and CSP(r) are tractahle. 
Proof. We begin by showing the strict inclusion 

CSP(x)UCSP(r)CCSP(^). 

That the inclusion holds follows directly from Lemma 13.101 To see that the inclusion is 
strict, observe that x t occur in a CSP pattern whose domain is the disjoint union of 
those for x and r but whose variable set has size equal to the larger of the two original 
variable sets. Any CSP instance containing this pattern is neither in CSP(x) nor in CSP(r). 
However, we can construct a CSP instance containing this pattern which is contained 
in CSP(xUt), as the assumption that all variables are distinct means that x^^ is not 
contained in this pattern. 
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Suppose P G CSP(xWr). If P G CSP(x) U CSP(r) then P can be solved in polynomial 
time, by the tractability of CSP(x) and CSP(r). 

So we may suppose that P- Choose a particular occurrence of x in -P and let a 
denote the set of variables used in the containment. Consider any assignment t : a ^ 
D. Let Pt denote the problem obtained by making this assignment and then enforcing 
arc-consistency on the resulting problem. This corresponds to adding some new unary 
constraints to P. 

We will show that if r occurs in Pt then xUr must occur in P. To see this, observe that 
any containment of r in Pm naturally induces a containment of r in P that extends to a 
containment of xUr in P, by considering the occurrence of x ct- Thus, we can conclude 
that Pt G CSP(t), and so can be solved in polynomial time. 

By construction, any solution to Pt extends to a solution to P by adding the assignment 
t to the variables a. Moreover, every solution to P corresponds to a solution to Pt for some 
t : a ^ D. Since the size of x is fixed, we can iterate over the solutions to x i^i polynomial 
time. If P has a solution, then we will find it as the solution to some Pj. If we find that 
no Pt has a solution, then we known P does not have a solution. Thus, since we can solve 
each Pt in polynomial time, we can also solve P in polynomial time. □ 

Corollary 4.3. No tractable class defined by forbidding a flat pattern is maximal. 

Proof. Let x be any tractable flat pattern. Consider the pattern defined by the disjoint 
union of two copies of x, which we denote x^^-*. By Lemma [4.21 we have that CSP(x(^)) is 
tractable but also that 

CSP(x)CCSP(^), 

and hence CSP(x) is not a maximal tractable class. □ 

It follows that we cannot characterise tractable forbidden patterns by exhibiting all 
maximal classes defined by tractable forbidden patterns. Indeed, a consequence of Lemma 14. 21 
is that we can construct an infinite chain of patterns, such that forbidding each one gives 
rise to a slightly larger tractable class. Naturally, if we place an upper bound on the size of 
the patterns then there are only finitely many patterns that we can consider, so maximal 
tractable classes defined by forbidden patterns of bounded size necessarily exist. 

For the moment, we are not able to make a conjecture as to the structure of a dichotomy 
for general forbidden patterns. Nonetheless, in Section [6|, by restricting our attention to a 
special case, forbidden negative patterns, we are able to obtain interesting general results. 

5 Tractable forbidden patterns on three variables 

In the previous section, we showed that we need to place restrictions on the size of the 
forbidden patterns if we want to establish any sort of dichotomy. Since forbidden patterns 
on two variables only place restrictions on the set of constraint relations that can occur in 
an instance, the first interesting hybrid classes occur when we consider three variables. In 
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Pattern 6 Broken triangle pattern (btp) 




b 



a 




Vi <V2 < V3 



this section we present two hybrid tractable classes of binary CSP instances characterised 
by forbidden patterns on three variables. 

The first example, already introduced in [6], is known as the broken-triangle prop- 
erty (btp). In order to capture this class by a forbidden pattern we have to work in 
a context where the set of variables is totally ordered. In this case pattern containment 
must preserve the total order. We can define the broken-triangle property by the forbidden 
pattern btp, shown in Pattern [6l The following result was proved in [6]. 

Theorem 5.1. Let btp be the pattern in Pattern\^ The class of CSP instances CSP(btp) 
can be solved in polynomial time. 

It is easy to see that Tree (shown in Pattern [5]) occurs in btp (with some truth- values 
U being changed to T). It follows from Lemma ISTTOl that CSP (Tree) C CSP (btp). Hence 
the class CSP (btp) includes all CSP instances whose constraint graph is a tree. However, 
CSP (btp) also includes CSP instances whose constraint graph has tree width r for any 
value of r: consider, for example, a CSP instance with r + 1 variables and an identical 
constraint between every pair of variables which simply disallows the single tuple (0,0). 

For any tractable forbidden pattern relative to a context with an order on the variables, 
we can obtain another tractable class by considering problems forbidding the pattern in 
a flat context. The class obtained is (possibly) smaller because it is easier to establish 
containment of the fiat pattern. In the particular case of the broken-triangle property, we 
obtain a strictly smaller tractable class by forbidding Pattern |6] for all triples of variables 
^i)t'2,t^3 irrespective of their order. We can easily exhibit a CSP instance that shows this 
inclusion to be strict: for example, the 3-variable CSP instance over Boolean domains 
consisting of the two constraints Vi = V2, Vi = with the variable ordering fi < ^2 < ^3. 

Our second example is a generalisation of the well-known tractable class of problems, 
AllDifferent+unary [231 [29]: an instance of this class consists of a set of variables V, 
a set of arbitrary unary constraints on V , and the constraint v ^ w defined on each pair of 
distinct variables f , G We define a more general class containing every such instance 
using the forbidden pattern shown in Pattern [7], which we call Negtrans. Forbidding 
this pattern insists that disallowing tuples is a transitive relation, i.e. if {{v , a) , {x , b)) 
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and {{x , b) , {w , c)) are disallowed then {{v , a) , {w , c)) must also be disallowed. By the 
transitivity of equality, Pattern [7] does not occur in any binary CSP instance in the class 
AllDifferent+unary. 

Pattern 7 Negative transitive pattern (Negtrans) 



V 




NEQ{v,w,x) 



Theorem 5.2. Let Negtrans denote Pattern^ The class of CSP instances forbidding 
Negtrans can be solved in polynomial time. 

Proof. We prove this by a straightforward reduction to the well-known tractable problem 
AllDifferent+unary P I2l]. 

Let P = {V,D,C)he& binary CSP in which Negtrans does not occur, and let n = \ V\ 
and d = \D\. We define the graph Gp which we call the inconsistency graph of P. The 
vertices of Gp are the pairs {v, c) where v is a. variable in P and c G -D is allowed by the 
unary constraint on v. The edges of Gp are the pairs of vertices {{v^a) , {w,b)} of Gp 
for which there exists a constraint {{v, w) , p) with scope {v, w) such that (a, b) ^ p. (The 
inconsistency graph is the microstructure complement [22] without edges between pairs of 
assignments to the same variable.) 

We first prove that, for any connected component H of Gp, either 

• The subgraph of Gp induced by is a complete multipartite graph with edges 
{{v, a) , {w, b)} for each (f , a) , {w, b) E H satisfying v w {in this case, we call H an 
inconsistency clique), or 

• H meets exactly two variables of P: \{v \ {v, a) G H}\ = 2. 

Any connected component H that meets only one variable is a trivial inconsistency 
clique. Consider a component H that meets at least three distinct variables. To show that 
H is an inconsistency clique we have only to show that the two end-points of every path of 
length three meeting three variables and of every path of length four meeting two variables 
are connected. (The length of a path is the number of vertices on the path). 

Let {v, a) , {x, c) , {w, b), he a path of length three in H, where v, x, w are distinct vari- 
ables. Since Negtrans does not occur in P, we must have a constraint {{v,w) , p) with 
(a, b) ^ p. 
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Let {v, a) , {w, a') , {v, b') , {w, b) be a path of length four in H. Since H is connected and 
comprises at least three variables, there is some other vertex (x, c) with x ^ {v,w} which 
is connected to this path. Since this creates paths of length three through three variables 
we can repeatedly use the argument given above to show that {x, c) is adjacent to each of 
the four vertices on the path. Finally, since it is adjacent to both (t>, a) and {w, b) we use 
the argument one last time to show that these two vertices are adjacent. 

We can now demonstrate the reduction to AllDifferent+unary. First we can 
identify all connected components of Gp in polynomial time. For each component H that 
is not an inconsistency clique, H meets exactly two variables v, w and there is some pair of 
vertices {v, a) and {w, b) which are not adjacent and which are adjacent to no vertex {x, c) 
for any other variable x. We can therefore make the assignments v = a,w = b and remove 
from Gp all vertices corresponding to assignments to these variables. We denote by V the 
remaining set of variables after removing each such pair of variables from P. Note that we 
have an assignment to every variable not in V that is consistent with any assignment to 
the variables of V. 

Let Py denote the resulting CSP instance on variables V and Gp^, the corresponding 
inconsistency graph. The components Hi, . . . , Hm of Gp^, are all inconsistency cliques. 
For each component Hi and each variable v we define Hi{v) = {{w, c) & Hi \ w = v}. 

Consider a CSP P' with variables V and domain {1, . . . ,rn}. Apply the unary con- 
straint on each variable v of P' given by the unary relation {{i) \ Hi{v) ^ 0}. Finally apply 
the AllDifferent constraint over all variables of P'. 

No solution to Py can contain two assignments from the same component of Gp^,. 
Therefore, to every solution s to Py there is a corresponding solution s' to P': choose 
s'{v) = i where {v,s{v)) G Hi. 

Conversely, any solution s' to P' corresponds to a solution s to Py by choosing s{v) to 
be any value in for each v &V. 

The time taken to obtain Gp^, from P is proportional to the total number of disallowed 
tuples in P; hence, the time taken is 0{\C\(P). Solving P' is equivalent to finding a perfect 
matching in a bipartite graph with \V'\ + m vertices and up to |l^'|m edges. Using the 
Fibonacci heap data structure, we can find a perfect matching in a bipartite graph with 
vertices and M edges in time 0(A^log(A) + NM) [12]. Thus, we can find a solution to 
P' in time 0{{n + m)^ log(n + m) + (n + m)nm) . The maximum value of m occurs when 
each component of Gp contains exactly three assignments, so we will always have m < 
Thus, under the reasonable assumption that d < n, we can solve P' in time 0{n^(P). Since 
\C\ is O(n^), it follows that P can be solved in time 0{n^(P). □ 

It has recently been shown [7] that the tractable class defined by forbidding Pattern [7] 
(Negtrans) can be extended to soft constraint problems but that this is not the case for 
the class of problems obtained by forbidding Pattern [6] (btp) (in the sense that the class 
becomes NP-hard if all unary soft constraints are also allowed). 

Having demonstrated through the btp and Negtrans patterns that forbidding pat- 
terns provides a language which is rich enough to define interesting hybrid tractable classes. 
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we concentrate in the rest of the paper on progress towards characterising tractable for- 
bidden patterns. 

6 Binary flat negative patterns 

In this section we define three particular patterns and one infinite class of patterns. We 
then use these patterns to characterise a very large class of intractable patterns. We prove 
that any finite set of patterns not in this class has a simple structure: one of the patterns 
must contain one of a particular set of patterns, which we call pivots. This means that 
any tractable finite set of patterns must include a pattern in which a pivot pattern occurs. 
Furthermore, we exhibit a class of patterns which are contained in pivots and which we are 
able to prove give rise to a tractable class. We conjecture that pivots are also tractable; if 
this is the case then it implies a simple characterisation of the tractability of finite sets of 
binary fiat negative patterns. 

Definition 6.1. A constraint pattern {(y,p) will be called negative if p never takes the 
value T. A CSP pattern x is negative if every constraint pattern in x is negative. 



Pattern 8 Cycle(6) 




In Definition 16.21 below, we define the concept of a connected negative binary pattern. 
These correspond to negative binary patterns x such that every realisation of x as a binary 
CSP instance has a connected constraint graph. We first generalise the notion of constraint 
graph to CSP patterns. We call the resulting graph the negative structure graph. 

Definition 6.2. Let x be any binary negative pattern. The vertices of the negative struc- 
ture graph G are the variables of x- A pair of vertices is connected in G if and only if 
they form a scope in x whose constraint pattern assigns at least one tuple the value F. We 
say that x is connected if its negative structure graph is connected. 

For example. Pattern IH] (Valency), Pattern [10] (Path) and Pattern[TT](VALENCY+PATH) 
are not connected. Note that a pattern which is not connected may occur in a connected 
pattern (and vice versa). Pattern [8] shows Cycle(6) which is connected. This is just one 
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Pattern 9 Valency 




Context: NEQ{xi,X2,Xs,x[) /\ N EQ^x^.x^^x'^) 



Pattern 10 Path 




Vl V2 V3 Wi W2 W3 



NEQ{vi,V2,V3,wi) A NEQ{wi, 102,103) 



example of the generic pattern Cycle(A;) where k > 2. The only context for CYCLE(fc) is 
that all variables are distinct, except for the special case k = 2 for which the context also 
includes c 7^ c', meaning that Cycle(2) is composed of a single binary constraint pattern 
containing two distinct inconsistent tuples. The following theorem uses these patterns to 
show that most patterns are intractable. 

Theorem 6.3. Let X he any finite set of connected negative binary patterns. If at least 
one o/Cycle(A;) (for some k>2), Valency, Path, or Valency+Path occurs in each 
X G A" then CSP(A:') is intractable. 

Proof. Let A* be a set of connected negative binary patterns and let I be the number of 
variables in the largest member of X. 

Assuming at least one of the four patterns occurs in each x G A", we can construct a 
class of CSPs in which no element of X occurs and to which we have a polynomial reduction 
from the well-known NP-complete problem 3SAT [T3]. 

The construction will involve three gadgets, examples of which are shown in Figure [H 
These gadgets each serve a particular purpose: 

1. The cycle gadget, given in Figure [D^a) for the special case of 4 variables, enforces that 
a cycle of Boolean variables {vi,V2, ■ ■ ■ , Vr) all take the same value. 
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Pattern 11 Valency+Path 




Vl 



©-©-© 

Wi W2 W3 



NEQ{vi,V2,V3), NEQ{wi,W2,W3), and x ^ W2 



2. The clause gadget in Figure [UJ^b) is equivalent to the clause f 1 V 1^2 V V3, since vc has 
a value in its domain if and only if one of the three Vi variables is set to true. We can 
obtain all other 3-clauses on these three variables by inverting the domains of the Vi 
variables. 

3. The line gadget in Figure [T]^c), imposes the constraint Vi =^ V2- It can also be used 
to impose the logically equivalent constraint -1^2 ^ ~^Vi. 

Now, suppose that we have an instance of 3SAT with n propositional variables Xi, . . . , X„ 
and m clauses Ci, . . . , Cm- 

We begin to build a CSP instance \E' to solve this 3SAT instance by using n copies 
of the cycle gadget (Figure [^a)), each with m(£ + 1) variables. For i = 1, . . . ,n, the 
variables along the ith copy of this cycle are denoted by {vl, vf,..., v^^^^^^^). In any solution 
to a CSP instance with these and other constraints, we will have that the variables 
vl , j = 1, . . . , m{i + 1) must all have the same value, di. We can therefore consider each vj 
as a copy of Xj. 

Consider the clause C^. There are eight cases to consider but they are all very similar 
so we will show the details for just one case. Suppose that = V Xj V -iX^. We 
build the clause gadget (Figure [l](b)) with the three Boolean variables being c^, c{, and 
and invert the domain of since it occurs negatively in Cw Then any solution s to our 
constructed CSP must satisfy s(c^) V s(c{,) V ~'s(c^) = T. 

We complete the insertion of into the CSP instance by adding some length i + 1 
line constructions (Figure [It^c)). We connect the cycle gadgets corresponding to Xj, Xj 
and Xfc to the clause gadget for clause Cw since Xj,Xj and Xk occur in C^. We connect 
^w{e+i) since Xj is positive in C^, so s(cj^,) = T is only possible when s{y^'^^^^^') = T, 

for any solution s. Similarly, we connect vj^^'^^^ to c{^. Finally, since X^ occurs negatively 
in Cw, we impose the line constraints in the other direction. This ensures that s(c^) = F 
is only possible when s{v^^^~^^'^) = F. Imposing these constraints ensures that a solution is 
only possible when at least one of the cycles corresponding to variables Xj, X,-, and X^ is 
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^^1 



V2 



(c) 

Figure 1: (a) Making copies of the same variable (fi = V2 = Vs = V4). (b) Imposing the 
ternary constraint vc = ViV V2\/ V3. (c) A hne of constraints which imposes Vi =^ ^2. 
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assigned a value that would make the corresponding literal in true. 

We continue this construction for each clause of the 3SAT instance. Since £ is a constant, 
this is clearly a polynomial reduction from 3SAT. 

We now show that any CSP instance P$ constructed in the manner we have just de- 
scribed cannot contain any pattern in X. We do this by showing that no pattern containing 
CYCLE(k) (for 2 < k < i), Valency, Path, or Valency+Path can occur in the instance. 
This is sufficient to show that the CSP Pij, does not contain any of the patterns in X. 

In the CSP P^, no constraint contains more than one inconsistent tuple. Thus, any 
X € A" for which Cycle(2) — j-x cannot occur in Pij,. Furthermore, Pq; is built from cycles 
of length m{i + 1) and paths of length £ + 1, and so cannot contain any cycles on less than 
£ + 1 vertices. Thus, since £ is the maximum number of vertices in any component of X, 
it follows that no x G A" for which Cycle(A;) — i- x, for any A; > 3, can occur in P^. 

We define the valency of a variable x to be the number of distinct variables which share 
a constraint with x. Suppose Valency — )■ x- For this to be possible we require that there 
is a variable of valency four in x, or a pair of variables of valency three connected by a path 
of length at most £. Certainly P^ has no variables of valency four. Moreover, the fact that 
P^. was built using paths of length £ + 1 means that no two of its valency three variables 
are joined by a path of length at most £. Thus, any x G A* for which Valency—)- x does 
not occur in Pq,. 

Next, consider that case when Path — )■ x- Here x must have two distinct (but possibly 
overlapping) three-variable lines (with inconsistent tuples in these constraints that match 
at domain values) separated by at most £ variables. The only place where inconsistent 
tuples can meet in Px^, is when we connect the line gadget to the cycle gadget. These 
connection sites are always at distance greater than £, so we can conclude that x 7^ -Pi/- 
whenever Path — )■ x- 

Finally, consider the case where Valency+Path — )■ x- Here, x must have one variable 
of valency at least 3 and a path of constraints on three variables with intersecting negative 
tuples, and these must be connected by a path of less than £ variables. As observed above, 
the only places where we can have inconsistent tuples meeting is where the line gadget meets 
the cycle gadget, and there is a path of at least £ variables between each one of these points 
and every other variable of valency 3. Thus, x 7^ Pi' whenever Valency+Path — ?■ x- □ 

It remains to consider which sets of negative binary patterns could be tractable. For 
this, we need to define the pivot patterns, PlvOT(r), which contain every tractable pattern. 

Definition 6.4. Let V = {p}U{vi, . . . , Vr}U{wi, . . . , Wr}U{xi, . . . , Xr} and let D = {a, b}. 
We define the pattern PlvOT(r) = {V, D, Cp U C^U Li Cx), where 

Cp = {{((p, a) , h))} ^ F, {{{p, a) , {w,, b))} ^ F, {{{p, b) , {x,,b))} ^ F} 
Cv = {{{{vi.a) , {vi+ub))} ^ F I 2 = 1, . . .,r - 1} 
Cw = {{{{wi, a) , {wi+i, b))} ^ F \ i = 1, . . . ,r ~ 1} 
Cx = {{{{xi, a) , {Xi+i, b))} ^ F \ i = l,...,r-l} 

See Pattern for an example, Pivot(3). 
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Pattern 12 Pivot(3) 




The following proposition characterises those sets of binary flat negative patterns which 
Theorem 16.31 does not prove intractable. 

Proposition 6.5. Any connected binary flat negative pattern x either contains Cycle(A;) 
(for some k >2), Valency, Path, or Valency t^Path, or it itself occurs in PivoT(r) 
for some integer r < \x\- 

Proof. Suppose x does not contain any of the patterns VALENCY, Cycle(A;) (for any 
k > 2), Path, or Valency+Path. Since Cycle(2)74x, each constraint pattern in x 
contains at most one inconsistent tuple. Recall that the valency of a variable x is the 
number of distinct variables which share a constraint with x. Since x does not contain 
Valency it can only contain one variable of valency three and all other variables must 
have valency at most two. Moreover, since Cycle(A;) -/^ x fo^^ k > 3, the negative structure 
graph of X does not contain any cycles. Then, since x is connected, the negative structure 
graph of X consists of up to three disjoint paths joined at a single vertex. The fact that 
X does not contain Path means there can be at most one pair of intersecting inconsistent 
tuples in x and, moreover, the fact that x does not contain Valency+Path means that 
this intersection must occur on the variable with valency 3, if it exists. It then follows that 
X must occur in PlVOT(r), for some r < □ 

Corollary 6.6. Let X be a set of connected binary flat negative patterns. Then CSP{X) 
is tractable only if there is some x ^ that occurs in PlvoT(r), for some integer r < \x\- 

For an arbitrary (not necessarily flat or negative) binary CSP pattern x, we denote by 
neg(x) the fiat negative pattern obtained from x by replacing all truth- values T by f/ in 
all constraint patterns in x and ignoring the context. For a set of patterns X, neg(A') 
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is naturally defined as the set neg(A:') = {neg(x) : X ^ Clearly CSP(neg(A')) C 
CSP(A'). The following result follows immediately from Corollary 16.61 

Corollary 6.7. Let X be a set of binary patterns such that for each x G A", neg(x) is 
connected. Then CSP{X) is tractable only if there is some x G A* such that neg(x) occurs 
in PlVOT(r), for some integer r < \x\- 



Pattern 13 SepPivot(3) 
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NEQ{p, Vi,V2, V3, Wi,W2, W3, Xi,X2, X3) 



We now define a pattern we call a separable pivot; forbidding this pattern defines a 
subclass of CSP(PlV0T(r)). 

Definition 6.8. Let V = {p} U {vi, . . . , Vr} U {wi, . . . , Wr} U {xi, . . . , Xr} and let D = 

{a, 6, c}. We define the pattern SEPPlvoT(r) = {V, D, Cp U C^, U U C^), where 

Cp = {{{{p, a),{vi,b))}^ F, {{{p, b),{wi,b))}^ F, {{{p, c) , (xi, 6))} ^ F} 
Cv = {{{{vu a) , {vi+i, 6))} F I i = 1, . . . , r - 1} 
Cw = {{{{wi,a) , {wi+i,b))} h-> F I 2 = 1, . . . ,r - 1} 
Cx = {{{{xi, a) , (xi+i, b))}^ F\i = l,...,r-1} 

See Pattern [7g| for an example, SepPivot(3). 

Clearly, SEPPlVOT(r) occurs in PlVOT(r): we take a bijection between corresponding 
variable-value pairs for the Vi, Wi and Xi variables, map both {p,a) and {p,b) in Sep- 
PlvoT(r) to {p,a) in PlvoT(r), and map {p,c) in SEPPlvOT(r) to {p,b) in PlvoT(r). We 
will now show that CSP(SEPPlV0T(r)) is tractable for any fixed r. 

Theorem 6.9. Let r be a fixed integer. CSP(SEPPlV0T(r)) can be solved in polynomial 
time. 
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Proof. By the grid minor theorem of Robertson and Seymour [26] , there exists a function 
/ : N — i- N such that any graph G with tree width tw{G) > f{r) must contain an r x r 
grid as a minor. 

Now SEPPlvOT(r) occurs in any CSP instance whose constraint graph contains a vertex 
that starts three disjoint paths. Certainly, any CSP instance P whose constraint graph 
contains an r x r grid as a minor will satisfy this condition. Hence, P G CSP(SEPPlV0T(r)) 
is only possible when tw{P) < f{r). Since the class of CSP instances with tree width 
bounded above by /(r) is tractable, it follows that CSP(SEPPlV0T(r)) is tractable^ □ 

The following corollary is a direct consequence of Lemma 14.21 
Corollary 6.10. Any disjoint union of SEPPlVOT(r) patterns is a tractable pattern. 

Next, we show that forbidding Pivot(I) gives rise to a tractable class of CSPs. 

Proposition 6.11. CSP(Pivot(1)) can be solved in polynomial time. 

Proof. We will show that every P G CSP(Pivot(1)) can be reduced in polynomial time to 
P' G CSP(Negtrans) such that P has a solution if and only if P' has a solution. Without 
loss of generality, we assume that P is arc-consistent since eliminating domain values (by 
arc consistency) cannot destroy the fact that P is PlV0T(l)-free. 
Suppose Negtrans occurs in P at {u,p,v}: 





P 



Since P does not contain Pivot(I), it follows that p cannot be connected to any variables 
other than u, v in the constraint graph of P. Thus, we can obtain an equivalent CSP Pi by 
eliminating p and tightening the constraint on {u, v} by disallowing any assignment which 
does not extend to an assignment of p. This new CSP is still PlV0T(l)-free but has had the 
occurrence of Negtrans on {u,p, v} eliminated. To see that Pi is PlV0T(l)-free, suppose 
that the pair of assignments {{u,a) , {v,b)) becomes incompatible in Pi after elimination 
of variable p from P. By arc consistency of P, there are values c, d such that the pairs 
((it, a) , {p, c)), ((f , b) , {p, d)) are consistent in P. But, since {{u, a) , {v, b)) cannot be ex- 
tended to an assignment of p in P, this implies that the pairs {{u, a) , {p, d)), {{v, b) , (p, c)) 
are necessarily inconsistent in P. Now, if the inconsistent pair ((m, a) , (f , b)) were part of 
an occurrence of Pivot(I) in Pi, then we could easily obtain an occurrence of Pivot(I) 

■^The best upper bound on the function f{r) is superexponential: 20^''^ [IS]. Thus, Theorem 16.91 does 
not actuaUy provide a practical algorithm for solving problems in CSP(SEPPivOT(r)). 
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in P by replacing a) , (v, h)) by either ((m, a) , (p, d)) or ((t>, 6) , (p, c)) (depending on 
whether it is variable u oi v which is at the centre of the pivot). 

Thus, by repeatedly identifying and eliminating occurrences of Negtrans, we will 
eventually (after the elimination of at most n—2 variables) obtain a CSP P' G CSP (Negtrans). 
By the way we have constructed P', we know that any solution to P' can be extended to 
an assignment on the removed variables. Thus, since we can solve any instance P' G 
CSP(Negtrans) in polynomial time (Theorem 15. 2p . we can solve any instance P from 
CSP(Pivot(1)) in polynomial time. □ 

Proposition 16.111 is important as it gives us a tractable class of CSPs defined by forbid- 
ding a negative pattern which, unlike CSP(SEPPlV0T(r)), contains problems of unbounded 
tree width, and so cannot be captured by structural tractability. As an example of a class 
of CSP instances in CSP(Pivot(1)) with unbounded tree width, consider the n-variable 
CSP instance Pn with domain {1, . . . , ??,} whose constraint graph is the complete graph and, 
for each pair of distinct values i,j G {1, . . . ,r2,}, the constraint on variables Vi,Vj disallows 
a single pair of assignments {{vi,j) , {vj,i)). Since each assignment {vi,j) occurs in a single 
inconsistent tuple, Pivot(I) does not occur in P„, and hence P„ G CSP(Pivot(1)). 

We conjecture that there exists a larger class of tractable problems defined by forbidding 
negative patterns. 

Conjecture 6.12. For a fixed integer r, CSP(PlV0T(r)) can be solved in polynomial time. 

A positive answer to Conjecture I6.12[ taken in conjunction with Corollary 16. 6[ would 
give a dichotomy result for CSPs defined by forbidding a finite set of binary flat negative 
patterns, which we state in the form of a conjecture. 

Conjecture 6.13. Let X be a finite set of connected binary flat negative patterns. Then 
CSP (A") is tractable if and only if there is some x ^ that is contained in PlVOT(r), for 
some integer r < 

7 Conclusion 

In this paper we described a framework for identifying classes of CSPs in terms of forbidden 
patterns, to be used as a tool for identifying tractable classes of the CSP. We gave several 
examples of small patterns that can be used to define tractable classes of CSPs. 

In the search for a general result, we restricted ourselves to the special case of binary 
flat negative patterns. In Theorem l6.3l we showed that CSP(A') is NP-hard if every pattern 
in a set X contains at least one of four patterns (Patterns IHl l9l fTOl andfTTl). Moreover, we 
showed that any connected binary fiat negative pattern that did not contain any of these 
patterns must itself be contained within a special type of pattern called a pivot. Hence, the 
presence of a pivot is a necessary condition for pattern x to be tractable. We were able to 
show that another pattern, which we call separable pivot, occurs in the pivot pattern and 
defines a tractable class. Hence, separable pivots define a tractable subclass of the class 
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defined by pivots. We conjecture that tractability extends to the whole class of problems 
defined by pivots. We proved tractability for pivots of size 1. 

The main open problem is the resolution of the tractability of pivots of any fixed size 
r. Beyond this, it will be interesting to see what new tractable classes can be defined 
by more general binary patterns or by non-binary patterns. In particular, an important 
area of future research is determining all maximal tractable classes of problems defined by 
patterns of some fixed size (given by the number of variables or the number of variable- value 
assignments). 
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